Learning Unitary Operators with Help From u(n)
نویسندگان
چکیده
A major challenge in the training of recurrent neural networks is the so-called vanishing or exploding gradient problem. The use of a norm-preserving transition operator can address this issue, but parametrization is challenging. In this work we focus on unitary operators and describe a parametrization using the Lie algebra u(n) associated with the Lie group U(n) of n× n unitary matrices. The exponential map provides a correspondence between these spaces, and allows us to define a unitary matrix using n real coefficients relative to a basis of the Lie algebra. The parametrization is closed under additive updates of these coefficients, and thus provides a simple space in which to do gradient descent. We demonstrate the effectiveness of this parametrization on the problem of learning arbitrary unitary operators, comparing to several baselines and outperforming a recently-proposed lower-dimensional parametrization. This suggests a route to generalising a recently-proposed unitary recurrent neural network to arbitrary unitary matrices, solving a problem the well-known long short-term memory network was invented to address, but with a simplified and elegant network architecture.
منابع مشابه
The residual spectrum of $U(n,n)$; contribution from Borel subgroups
In this paper we study the residual spectrum of the quasi-split unitary group $G=U(n,n)$ defined over a number field $F$, coming from the Borel subgroups, $L_{dis}^2(G(F)backslash G(Bbb A))_T$. Due to lack of information on the local results, that is, the image of the local intertwining operators of the principal series, our results are incomplete. However, we describe a conjec...
متن کاملA note on $lambda$-Aluthge transforms of operators
Let $A=U|A|$ be the polar decomposition of an operator $A$ on a Hilbert space $mathscr{H}$ and $lambdain(0,1)$. The $lambda$-Aluthge transform of $A$ is defined by $tilde{A}_lambda:=|A|^lambda U|A|^{1-lambda}$. In this paper we show that emph{i}) when $mathscr{N}(|A|)=0$, $A$ is self-adjoint if and only if so is $tilde{A}_lambda$ for some $lambdaneq{1over2}$. Also $A$ is self adjoint if and onl...
متن کاملComputing Wiener and hyper–Wiener indices of unitary Cayley graphs
The unitary Cayley graph Xn has vertex set Zn = {0, 1,…, n-1} and vertices u and v are adjacent, if gcd(uv, n) = 1. In [A. Ilić, The energy of unitary Cayley graphs, Linear Algebra Appl. 431 (2009) 1881–1889], the energy of unitary Cayley graphs is computed. In this paper the Wiener and hyperWiener index of Xn is computed.
متن کاملGeneralized Weighted Composition Operators From Logarithmic Bloch Type Spaces to $ n $'th Weighted Type Spaces
Let $ mathcal{H}(mathbb{D}) $ denote the space of analytic functions on the open unit disc $mathbb{D}$. For a weight $mu$ and a nonnegative integer $n$, the $n$'th weighted type space $ mathcal{W}_mu ^{(n)} $ is the space of all $fin mathcal{H}(mathbb{D}) $ such that $sup_{zin mathbb{D}}mu(z)left|f^{(n)}(z)right|begin{align*}left|f right|_{mathcal{W}_...
متن کاملSpectral Properties of Random Unitary Band Matrices
i It is a pleasure to thank Professor Stoiciu for his patience and selflessness in being my thesis advisor. Second, I would like to thank Professor Silva for being my second reader and for his helpful suggestions. Finally, I would like to thank my parents, without whom I could never have completed this thesis. One of the most important results in analysis is the Spectral Theorem, which shows th...
متن کامل